Incorporating Concreteness in Multi-Modal Language Models with Curriculum Learning

نویسندگان

چکیده

Over the last few years, there has been an increase in studies that consider experiential (visual) information by building multi-modal language models and representations. It is shown several acquisition humans starts with learning concrete concepts through images then continues abstract ideas text. In this work, curriculum method used to teach model concrete/abstract their corresponding captions accomplish modeling/representation. We use BERT Resnet-152 on each modality combine them using attentive pooling perform pre-training newly constructed dataset, which collected from Wikimedia Commons based words. To show performance of proposed model, downstream tasks ablation are performed. The contribution work two-fold: A new dataset words, a approach proposed. results contributes success model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incorporating active learning into a traditional curriculum.

During the past three academic years, "self-learning exercises" (SLEs) have been incorporated into the Medical Physiology course for first-year students at the Morehouse School of Medicine. Roughly 20-30% of the material covered in the course is presented to the students in the form of these exercises, instead of in lectures. The exercises are intended to help the students develop skills in act...

متن کامل

Learning Multi-modal Similarity

In many applications involving multi-media data, the definition of similarity between items is integral to several key tasks, e.g., nearest-neighbor retrieval, classification, and recommendation. Data in such regimes typically exhibits multiple modalities, such as acoustic and visual content of video. Integrating such heterogeneous data to form a holistic similarity space is therefore a key cha...

متن کامل

Incorporating E-learning in teaching English language to medical students: exploring its potential contributions

Background: The spread of technology has influenced different aspects of human life, and teaching and learning are not exceptions. This study aimed to examine the potential contribution of the use of technology in teaching English language to medical students.   Methods: This qualitative-action research study was conducted in Birjand University of Medical Sciences (BUMS), with 60 medica...

متن کامل

Exploiting Multi-modal Curriculum in Noisy Web Data for Large-scale Concept Learning

Learning video concept detectors automatically from the big but noisy web data with no additional manual annotations is a novel but challenging area in the multimedia and the machine learning community. A considerable amount of videos on the web are associated with rich but noisy contextual information, such as the title, which provides weak annotations or labels about the video content. To lev...

متن کامل

Learning Multi-modal Control Programs

Multi-modal control is a commonly used design tool for breaking up complex control tasks into sequences of simpler tasks. In this paper, we show that by viewing the control space as a set of such tokenized instructions rather than as real-valued signals, reinforcement learning becomes applicable to continuous-time control systems. In fact, we show how a combination of state-space exploration an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11178241